An introduction to gene expression deconvolution and the CellMix package A Comprehensive Framework for Gene Expression Deconvolution
نویسنده
چکیده
This vignette motivates and describes the functionalities of the CellMix package, an R package for performing gene expression deconvolution analysis. The package defines a general framework to apply, develop and test gene expression deconvolution methods. It incorporates, generalises and extends the set of tools we implemented when developing a semi-supervised approach to this problem, and includes several other previously published algorithms, hence facilitating their application and comparison. A special focus is drawn on lists of cell/tissue marker genes, which are very valuable resources as they may be used not only as prior knowledge or post-hoc independent validation data by deconvolution algorithms, but also as gene sets in classical enrichment analysis. We envisage that this package will provide the bioinformatics research community with an easy to use and flexible platform for working with deconvolution methods, and cell heterogeneity in omics data in general. This vignette aims at providing a general background on gene expression deconvolution, motivating the CellMix package, as well as describing its main features. It serves as supplementary material for the following article: Renaud Gaujoux et al. “CellMix: A Comprehensive Framework for Gene Expression Deconvolution”. In: accepted (2012) Documentation, practical examples, and sample analyses can be found online at: http://web.cbio.uct.ac.za/~renaud/CRAN/web/CellMix Note that to reproduce some of the examples in this vignettes, you will also need to have the GEOquery package package installed, as well as some Bioconductor annotation packages which will be installed when needed: # install biocLite install.packages("BiocInstaller") # or alternatively do: source(✬http://www.bioconductor.org/biocLite.R✬) # install GEOquery (NB: might ask you to update some of your packages) library(BiocInstaller) biocLite("GEOquery") http://www.bioconductor.org/packages/release/bioc/html/GEOquery.html In an interactive R session, their installation is automated, after the user gives permission to do so.
منابع مشابه
CellMix: a comprehensive toolbox for gene expression deconvolution
UNLABELLED Gene expression data are typically generated from heterogeneous biological samples that are composed of multiple cell or tissue types, in varying proportions, each contributing to global gene expression. This heterogeneity is a major confounder in standard analysis such as differential expression analysis, where differences in the relative proportions of the constituent cells may pre...
متن کاملCellMix : Sample Analyses
Gene expression experiments are commonly performed using samples that are composed of a variety of cell types, each contributing to the global gene expression measurements in different proportions. The CellMix package defines a general framework for gene expression deconvolution that integrates several of the currently available deconvolution algorithms, and facilitates the estimation of cell t...
متن کاملAssessment of deconvolution technique for diagnosis of hydronephrosis in infants (under 6 months) [persian]
Introduction: Hydronephrosis is one of the major causes of referring infants for nuclear medicine investigation. Renal dynamic study is the most reliable investigation for such children. However direct evaluation of renogram leads to generation of parameters that have no physiological interpretation. The configuration of renogram is very much dependent upon the rate at which radiopharmace...
متن کاملDeconRNASeq: a statistical framework for deconvolution of heterogeneous tissue samples based on mRNA-Seq data
SUMMARY For heterogeneous tissues, measurements of gene expression through mRNA-Seq data are confounded by relative proportions of cell types involved. In this note, we introduce an efficient pipeline: DeconRNASeq, an R package for deconvolution of heterogeneous tissues based on mRNA-Seq data. It adopts a globally optimized non-negative decomposition algorithm through quadratic programming for ...
متن کاملMultivariate Feature Extraction for Prediction of Future Gene Expression Profile
Introduction: The features of a cell can be extracted from its gene expression profile. If the gene expression profiles of future descendant cells are predicted, the features of the future cells are also predicted. The objective of this study was to design an artificial neural network to predict gene expression profiles of descendant cells that will be generated by division/differentiation of h...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013